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SHORT COMMUNICATION 


The region between the M and S genes of porcine 
haemagglutinating encephalomyelitis virus is highly similar to 


human coronavirus OC43 


Elke Vieler, T. Schlapp and W. Herbst 


Institut flr Hygiene und Infektionskrankheiten der Tiere der Justus-Liebig-Universitat, Giessen, Germany 


The nucleotide sequences of the regions between 
the membrane and spike protein genes of three 
strains of porcine haemagglutinating encephalo- 
myelitis virus (HEV) were determined. A total of 739 
(HEV strain 67N) and 751 (strains NT9 and VW57 2) 
nucleotides were sequenced. Two ORFs, potentially 
encoding proteins of 12:8 and 9-6kDa, were 
identified. Pairwise comparisons with the corre- 
sponding ORFs in bovine coronavirus (BCV) and 
human coronavirus (HCV) OC43 revealed sequence 
similarities of greater than 88:5 % at the nucleotide 
and 85:3 % at the amino acid level for the 12:8 kDa 
ORF product. For the 9-6 kDa ORF product simi- 
larities were greater than 96:9% and 95:2%, 
respectively. An additional 12 nucleotide deletion 
upstream of the 12-8 kDa ORF start codon was 
found in HEV 67N compared to NT9 and VW572. 
These results reveal a genomic organization of HEV 
in the region analysed that is homologous to HCV 
OC43 but different from BCV. 


Porcine haemagglutinating encephalomyelitis virus (HEV) 
is a member of the Coronaviridae. Coronaviruses are spherical, 
enveloped viruses with a capped and polyadenylated positive- 
sense ssRNA genome of 27 to 32 kb. Infected cells contain 
genomic size virus mRNA and six to eight subgenomic mRNA 
species (depending on the virus) that have a 3’-coterminal 
nested set structure. A common 5’ leader sequence, derived 
from the 5’ end of the genomic RNA, is present in each mRNA 


Author for correspondence: Elke Vieler. Present address: Retrovirus 
Laboratory, Division of Comparative Medicine, Johns Hopkins University 
School of Medicine, 720 Rutland Avenue/Traylor G-60, Baltimore, MD 
21205, USA. 

Fax +1 410 955 9823. e-mail VIELER@WELCHLINK.WELCH.JHU.EDU 


The nucleotide sequence data reported in this paper have been 
deposited with the EMBL nucleotide sequence database under 
accession numbers X89861, X89862 and X89863. 


0001-3876 © 1996 SGM 


species. There are several different models for coronavirus 
mRNA synthesis. The leader-primed transcription model 
proposes interaction of the leader sequence with the full-length 
minus-strand RNA at the consensus intergenic sequence to 
initiate synthesis of subgenomic plus-strand RNAs. In contrast, 
the model of discontinuous transcription during negative- 
strand RNA synthesis suggests subgenomic minus-strand 
RNAs, containing an antileader sequence at the 3’ end, serve as 
templates for transcription of plus-strand subgenomic RNAs 
(Holmes & Lai, 1996). 

HEV belongs to the antigenic group II of coronaviruses, 
together with bovine coronavirus (BCV), human coronavirus 
(HCV) strain OC43, turkey coronavirus (TCV) and mouse 
hepatitis virus (MHV). Members of this group possess five 
structural proteins, nucleocapsid (N), membrane (M), small 
membrane (sM), spike (S) and haemagglutinin—esterase (HE), 
which are encoded by mRNAs 7, 6, 5-1, 3 and 2:1, respectively. 
Additionally, ORFs potentially encoding non-structural (ns) 
proteins have been identified. Their number and sizes differ 
between coronavirus species (Spaan ef al., 1988). Comparing 
BCV and HCV 0C43, the only major genomic difference was 
found in the region between the M and S genes. While in BCV 
four ORFs were located in this region, potentially encoding 
proteins of 4:9 and 4°8 (mRNA 4), 12:7 (mRNA 5) and 9°5 kDa 
(mRNA 5:1), in HCV OC43 only two ORFs, encoding proteins 
of 12°9 (mRNA 5) and 9°5 kDa (mRNA 5:1), were identified 
(Abraham et al., 1990; Mounir & Talbot, 1993 a). To date, it has 
not been determined if the 4:9 and 4:8 kDa ns proteins are 
expressed in BCV-infected cells. The 9:5 kDa proteins of BCV 
and HCV OC43, in contrast, have been shown to be expressed 
(Abraham ef al., 1990; Mounir & Talbot, 1993 a). The functions 
of the small ns proteins are not known. However, it has been 
proposed that they may act as an anchoring region for 
structural proteins during virus assembly or that they may 
maintain membrane association of the virus polymerase during 
replication (Boursnell & Brown, 1984; Boursnell ef al, 1985; 
Leibowitz et al., 1988; Skinner & Siddell, 1985; Skinner et al., 
1985). 

Among the haemagglutinating coronaviruses the genomes 
of MHV, BCV and HCV OC43 have been sequenced to the 
most part. Comparisons of the predicted amino acid sequences 
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-.:S —> Peptide —> 
HEV-NT9 UGUUGUGAUGAUUAUACUGGACACCAGGAGUUUGUAAUCAAAACUUCACAUGACGAUUAAGUUCGUCUUUGUUUUACUGACUCUUGACCAUAUAGUAACU 100 
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12.8 kDa —> 
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HEVAVWS72)-  astiod epiersinete-s teeteiials eo eee Boos CAUsry caress ure eee ono Creve Gna eo ahs Raised litany ease bie he parr te RN E 200 
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xk 
HEV-NT9 AACUAGAAGAUGUCUGUUUUAAAUAUAA CUAUCAAUUUCCCAAAGUAGGAUAUUGUAGAGUUC CUAAUUAUGCUUGGUGCCGUAACCAAGGUAGUUUUUG 300 
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HEV-NT9 UGCUACAUUUACUUUGUACGGCAAAUCCAAACAUUAUGAUAAAUAUUUUGGAAUAAUAACUGGUUUCACAGCAUUCUCUAAUUCUUUAGAAGAAGCUGUU 400 
HEV-VW572 cee eee eee DG She ceenn Siler a Ave Sy Bete oulSs, e850 Se SEMEN aah 'g Sueert 1a id SD Aatiah 98 8d 50Ke Sys Diaisectas estas Sores ¢ Green asese cease 400 
HEVIGING ©0000 see Gee epdversre: cieice whole se Sees, eseNe nw le tars AS) opens! Spareparers uscatin wists aid feb ace wis vere tennis WS Dranene st eireva a Tap eat gt 8 cay ote a ia ace “aos op ws sy ayer ner 388 
9.6 kDa —> 
HEV-NT9 AAUAAACUGGUUUUUUUAGCUGUUGACUUCAUUACCUGGCGAAGCCAGAGUUUAAAUGUUUAUGGCUGAUG CUUAUCUUGCAGACACUGUGUGGUAUGUG 500 
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HEV-NT9 GGGCAAAUAAUUUUUAUAGUUGCCAUUUGUUUAUUGGUUAUAAUAGUUGUAGUGG CAUUUUUGGCAACUUUUAAAUUGUGUAUUCAACUUUGCGGUAUGU 600 
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M —> 
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Fig. 1. Comparison of the nucleotide sequences between the M and S genes of HEV NT9, VW572 and 67N. The sequence of 
HEV NT9 is written in full. Positions having identical nucleotides are marked with dots, positions of deleted nucleotides are 
marked with dashes. Consensus intergenic sequences are underlined. Asterisks indicate stop codons. 


of the structural and ns proteins of BCV and HCV OC43 
revealed more than 91% identity (Cox et al., 1989; Kamahora 
et al., 1989; Kiinkel & Herrler, 1993 ; Labonté ef al., 1995; Lapps 
et al., 1987; Mounir & Talbot, 1992, 19934, b; Zhang ef al., 
19914, b, 1992). In contrast, no sequence data for HEV, except 
for a small portion of the S gene, have been published (Vieler 
et al, 1995). This short nucleotide sequence as well as 
hybridization studies using BCV S gene-specific probes 
indicated a close genomic relationship to BCV and HCV 
OC43. 

Since the only major genomic difference between BCV and 
HCV OC43 identified so far was found in the region between 
the M and S genes it was of interest to study this genomic 
region in the closely related HEV. We report here the sequence 
between the S and M genes of three HEV strains. Only two 
ORFs, with the potential to encode proteins of 12:8 and 
9-6 kDa, were identified in the HEV genome, a situation similar 
to HCV OC43 and different from BCV. 

The origin and cultivation of HEV strains 67N, VW572 and 
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NT9, BCV L9 and HCV OC43 as well as the HRT-18 cell line 
have been described previously (Vieler et al., 1995). Total RNA 
for RT-PCR was isolated from virus-infected cells by phenol— 
chloroform extraction followed by ethanol precipitation. 
Single-stranded cDNA was synthesized at 47:5 °C using 
SuperScript Plus RNase H™ reverse transcriptase with primer 
6341 (5° AACGTCATCCACATCAAGAAC 3’) correspond- 
ing to nucleotides 6321-6341 of the BCV 9:5 kDa sM protein 
(EMBL accession number M30612). cDNA for preparation of 
the N gene probe used for Northern blot hybridization was 
synthesized basically under the same conditions but using 
primer 3’-N (5° CTCCTGGTAAGCAATCCAGT 3’). The 
digoxigenin-labelled probe (DIG dUTP; Boehringer Mann- 
heim), corresponding to nucleotides 827-1376 of the BCV N 
gene (EMBL M16620), was prepared by PCR from BCV L9 
cDNA with primers 3’-N and 5’-N (5° TAGTAACCCT 
GAGGGAGTAC 3’). For sequencing of the region between 
the M and S genes, DNA was prepared by PCR using primers 
6341 and 5327 (5° ATGTGGTGGTTGTTGTGATGA 3’). 


(a) 


HEV-NT9 MD IWCPEKKYLRYTNGFNVSELEDVCFKYNYOF PKVGYCRV PNYAWRCRNOGS FCAT FTLYGKSKHYDKY FGI ITGFTAFSNSLEEAVNKLV FLAVDF I TWRSQSLNVYGX 1039 
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BCV-Mebus Bre Ss fay: Serre ere Wags baits Bide BreraKes 5 3.5 sysvaraies SH3 Veeck ee @ Lider toi9 8a RSs OM PS OS Pi TV. yess etek Ses ee R E...... 109 
(b) 
HEV-NT9 MFMADAYLADTVWYVGQIIFIVAICLLVI IVVVAFLATFKLCIQLCGMCNTLVLSPS I YVFNRGROFYEFYNEVKPPVLDVDDVX 84 
HEV-VW5S72 ww wwe ee PIG is ic faifoites Sous eek dese Ane aby SS See aca SSG. Salsas ee aS ie ge al opis wy ata e WAN ROR Ere 5S LER SASS 84 
HEVSGIN- (§ cbigjiceogishey dl sik MEE OGLE W wre ees She eae iaN Tank eer eetnecenclal e354 Balle says gf ousteliayelsteGwteey severely Diener. fecsisesiten oreee 84 
HOV=OC43: eb aiden ees de oe ROMS eles eee SD iveseh cane ol 2 dicta Gace ayo io a Ps seh ee ee ORNS nk Suwa ateite eves Digssislochids ete, ace.’ 84 
BCV-Mebus.......- Bingo ier es Gute oa ace ate guysnigua eee See Seana aed ertoearaheehy. wee eas Gore ee SoS SR Rew ete ore eae 84 
(c) 
HEV-NT9 MTIKFVFVLLTLDHIVTLSICQSFX 24 
HEV-VWS72 a plies ¢D ee i-0 Dicsare-asecs a ayo x 24 
HEV-67N oe N...----GF. we ee x 20 
HCV-0C43 ..T....DS.HX 11 
BCV-Mebus ..T....D..AP.D.LHPFNHVKLIIRPIEVEHI IIATIMPAVX 43 


Fig. 2. Alignment of the predicted amino acid sequences of the 12°8 kDa ORF product (a), the 9-6 kDa ORF product (b) and 
the 24 and 20 aa peptide (c) of HEV strains NT9, VWW572 and 67N compared to the corresponding sequences in HCV OC43 
(Mounir & Talbot, 1993) and BCV Mebus (Abraham et a/., 1990). The sequence of HEV NT9 is written in full. Only amino acid 
differences are indicated for the other coronavirus strains. Identical amino acid residues are marked with dots. The potential N- 
glycosylation site in the 12-8 kDa ORF product is indicated by an asterisk. 


The sequencing of both strands was performed as previously 
described (Vieler et al. 1995) using the additional primers 
HEV-NS1 (5° CTTGGTGCCGTAACCAA 3’), HEV-NS2 (5’ 
CTCTACAATATCCTACTTT 3’), HEV-NS3 (5° GGTTTCA 
CAGCATTCTCT 3’), HEV-NS4 (5° AGAGAATGCTGTG 
AAACC 3’), HEV-NS5 (5° GACACTGTGTGGTATGT 3’), 
HEV-NS6 (5° ACATACCAACAGTGTC 3’), HEV-NS7 (5’ 
ATAGCTTCATCAGCAGT 3’) and HEV-NS8 (5° GTGTAT 
TCAACTTTGCG 3’). The DNASTAR software package was 
used for DNA sequence analysis. For Northern blot analysis, 
poly(A)-RNA was isolated from virus-infected and mock- 
infected cells 16 h post-infection using the QuickPrep Micro 
mRNA Purification Kit (Pharmacia) according to the manu- 
facturer’s instructions. Subgenomic RNAs were separated on a 
08% agarose gel containing 2% formaldehyde and were 
transferred to a nylon membrane (Biodyne B; Pall). Prehybridi- 
zation and hybridization were done at 50 °C using the DIG 
Luminescent Detection kit for nucleic acids (Boehringer 
Mannheim) as recommended by the manufacturer. After 
hybridization, blots were washed twice for 5 min in 2 x SSC 
and 0°1% SDS at room temperature and then twice for 15 min 
at 75 °C in 0-5 x SSC and 0°1% SDS. The hybridization signals 
were immunologically detected according to the manu- 
facturer’s protocol. 


As shown in Fig. 1, the region between the S and M genes 
of HEV 67N, NT9 and VW572 contained two ORFs 
potentially encoding proteins of 12°8 and 9:6kDa. The 
predicted amino acid (aa) sequences of these ORFs are shown 
in Fig. 2. Additionally, an ORF coding for a 24 aa (HEV NT9 
and VW572) and 20 aa (67N) protein was observed upstream 
of the 12°8 kDa ORF beginning at nucleotide (nt) 50. 

The 12°8kDa ORF (nt 140-469 in NT9 and VW572, 
128-457 in 67N) potentially encodes a protein of 109 aa with 
a molecular mass of 12°82 (NT9 and VW572) or 12°77 kDa 
{67N). Nucleotide and amino acid sequence alignments with 
the corresponding sequences in BCV Mebus (Abraham et al., 
1990) and HCV OC43 (Mounir & Talbot, 1993.) revealed 
identities of more than 88-5 % at the nucleotide and more than 
85°3% at the amino acid level. As in BCV and HCV OC43 a 
potential N-linked glycosylation site was located at aa position 
18. The putative initiation codon with its surrounding context 
(UUAAUGG) and the consensus intergenic sequence (UCAA 
AAC) were identical to those found for HCV OC43. The 
distance between the initiation codon and the consensus 
intergenic sequence was 101 nt in HEV NT9 and VW572, and 
89 nt in 67N, while in HCV OC43 this distance is 109 nt. The 
shorter distance in HEV 67N compared to NT9 and VW572 
results from an additional 12 nt deletion upstream of the 
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Fig. 3. Northern blot analysis of HEV 67N and BCV L9 subgenomic 
mRNAs. Poly(A)—mRNA from HEV 67N- and BCV L9-infected cells and 
from uninfected cells (control) was hybridized with a DNA probe 
corresponding to nucleotides 827-1376 of the BCV N gene (Lapps et ai., 
1987). The subgenomic mRNA species are indicated. 


12°8 kDa start codon (between nt 75 and 88 in NT9 and 
VW572; Fig. 1). 

The 9:6 kDa ORF (nt 456-710 in HEV NT9 and VW572, 
444-698 in 67N) is predicted to encode an 84 aa protein with 
a molecular mass of 9°55 kDa in NT9, 9:56 kDa in VW572 and 
9°58 kDa in 67N. Similarities with the corresponding regions 
and their products in BCV and HCV OC43 are more than 
969% at the nucleotide and more than 95-2% at the amino 
acid level. The UCCAAAC consensus intergenic sequence 
begins 129 nt upstream of the first potential start codon, which 
is in the context UAAAUGU. These features correspond to 
those reported for BCV and HCV OC43 (Abraham et al., 1990; 
Mounir & Talbot, 1993). 

Detailed analysis of the amino acid sequences of the 
predicted proteins using the Clustal method (data not shown; 
DNASTAR software package) revealed a closer relationship 
between BCV and HCV OC43 than between either of them 
and the three HEV strains. The only exception was the 9:6 kDa 
sM protein of HEV 67N, which was more closely related to 
HCV OC43 than to all other coronaviruses tested. 

The 24 and 20 aa-encoding ORFs (nt 50-121 and 50-109), 
respectively, encode predicted peptides with molecular masses 
of 2°76 (HEV NT9), 2:75 (VW572) and 2:30 kDa (67N). Since 
only 645 nt of the HEV S2 gene were sequenced (Vieler et al., 
1995) the search for the consensus intergenic sequence of these 
peptides was restricted to this fragment. However, at the 
position where the consensus intergenic sequence UCAAAC 
(nt 3760-3765, Mounir & Talbot, 1993 b) was found for the 
HCV OC43 peptide and the corresponding consensus se- 
quence for the BCV Mebus 4:9 kDa ns protein gene (CCA 
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AAC; nt 1-6, Abraham ef al., 1990), the sequence CCAAUC 
(nt 394-399; Vieler et al., 1995) was found for HEV 67N. No 
other potential consensus sequence was found within the HEV 
67N S2 gene fragment. Alignments of the first 10 aa of the 
predicted peptides revealed a high similarity with the 11 codon 
ORF in HCV OC43 and with the 3’ end of the ORF encoding 
the 4-9 kDa protein in BCV Mebus (Fig. 2). 

Using Northern blot analysis (Fig. 3), HEV 67N and BCV 
L9 mRNAs 7, 6, 5-1 and 5, encoding the N , M , sM and 
12:8 kDa ns proteins, showed similar size. mRNA 3, 2°1 and 2, 
encoding the S, HE and NS2 proteins, migrated more slowly in 
BCV L9 than the corresponding mRNA species in HEV 67N. 
The subgenomic RNA potentially encoding the 4:9 and 
4:8 kDa ns proteins in BCV (mRNA 4) was visible as a minor 
band but was absent in HEV 67N RNA. Similar results were 
obtained for HEV NT9 and VW572 (data not shown). Mounir 
& Talbot (1993 a) reported that despite the lack of these two 
ORFs in HCV OC43, small amounts of mRNA 4 were present 
in HCV OC43-infected cells. In this study, however, mRNA 4 
could not be detected in any of the three HEV strains tested. It 
seems possible that mRNA 4 in HEV might be obscured by 
mRNA 5, since it is at least 290 nt smaller than BCV mRNA 4. 
Another reason may be that only very small amounts of this 
mRNA species were produced in the HEV-infected HRT-18 
cells. 

Since BCV, HEV and HCV OC43 all replicate well in HRT- 
18 cells, although the 4:9 and 4:8 kDa ns protein-encoding 
genes were absent in HEV and HCV OC43, it appears that 
these putative proteins are not essential for virus replication. 

It has been suggested that the absence of these two genes 
in HCV OC43, together with differences in the S protein, 
might be involved in the apparently preferential respiratory 
tropism of this virus (Mounir & Talbot, 1993). However, 
HEV has a strong tropism for neurons in experimentally 
infected mice and rats and in naturally infected piglets, but 
actively replicates in the respiratory tract before spreading to 
the nervous system (Fenner ef al., 1993; Hirano et al., 1993; 
Yagami ef al., 1993), indicating that other mechanisms than the 
absence of these genes are involved in tissue tropism. 
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